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AESTRACT 

Research on the Initial Teaching Alphabet (i.t.a.) 
is viewed frca two aspects--its limitations and how it can be 
improved. Thus far research has indicated that (1) transition from 
i.t.a. to traditional orthography is not the problem expected and (2) 
the children who are taught with i.t.a. read no better or worse than 
children taught with the conventional alphabet. Discussed are errors 
in research design related to (1) the use of control and experimental 
groups and materials used by each, the teachers, the investigator's 
attitude, sample size, and statisvical analyses used; (2) the 
duration of the investigation, which in many studies is but one year; 
and (3) problems with criterion measures centering around such issues 
as final measures for evaluating i.t.a., the adequacy of reading 
measures, and correlated measures. Suggestions were made for 
improving research design which can be implemented as further studies 
replicate or build upon vhat has been done. Further research studies 
with i.t.a. could lead not only to its more effective evaluation but 
also to an examination of basic concepts relating to aspects of 
educational reseatch. A bibliography is included. (DH) 
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A CRITIQUE OF RESEARCH 
WITH THE INITIAL TEACHING ALPHABET 
AND SOME RECOMMENDATIONS 



/. H* lUock Executive director t,f« a t'ouuiotion 
Hofstra Vnivcrsily* Hempstead* New York 




This is a lime of excitement in education. 
Innovations have come on the educational 
scene by the score and have hod considerable 
impact. Perhaps it Has Sputnik that made 
Americans aware that tve must advance as 
rapidly in education as in science. Perhaps 
it is only an effluent society* concerned with 
becoming a “great** one, that wc have both an 
awareness of the need for quality education 
and the capacity to provide it. The fact re- 
mains that wc are making an enormous effort 
to improve our educational effectiveness, tlut, 
rapid and exciting progress is rot without price. 
Each innovation intended to improve the effec- 
tiveness of education is met with great ac- 
ceptance by some, with strong rejection by 
others, and probably with cautious optimism by 
most. Educational television, for example, has 
experienced only limited success. Programmed 
instruction, too, has its cullists and its de- 
tractors. With both lochni^ies we anticipate 
some depersonali nation of the student-teacher 
relationship, yet recognize that our growing 
population demands some such solution. Un- 
doubtedly, the ultimate balance will involve 
intelligent and perhaps creative use of each, 
f Jut programmed instruction and educational 
television are, in the long run, merely devices 
which an educator may use to increase and ex- 
pand his effectiveness. 

The introduction of the Initial Teaching 
Alphabet has created similar controversies. 
It, loo, is a "device**. It suggests that one of 
the major causes for reading failure lies in 
the idiosyncrasies of our alphabet and spell- 
ings. It suggests further that, by introducing 
tempo! ary changes to make these more logical 
and consistent, we may develop not only better 
readers, bt\ better waiters as well. Perhaps 
most importantly, there is the hope that we 
May ircrease the lot* of reading and writing. 
The alphabet looks a little different. To some, 
thisisetciting: toothers, frightening. Although 
literate adults have no difficulty in reading in 
i.t.a., some are troubled by it. Commonly ex- 
pressed fears are: "How will a child be able 
toswitch to the conventional alphabet?** "How 
will he ever learn to spell?", not that w* can 



be particularly proud of how well our children 
spell when they learn to read using our con- 
ventional alphabet. 

Some of tne resistance to i.t.a* stems from 
genuine educational concern. Here, the fenr is 
that exposure of children to a temporary alpha- 
bet, no matter how carefully designed, may 
jeopardize their future reading ability, and 
since reading ability is so basic to the educa- 
tional process, thus may ultimately undermine 
their educational future. 

Someresistancestemssimply from resistance 
to change of rn> kind. There are many whose 
personality is such that change of any sort is 
threatening in and ol itself. 

Finally, thtre is resistance that arista out 
of commercial or professional interests. For 
many years, our approach to the teaching of 
reading has be.-n based largely on various 
“methods." The emphasis shifts from lime to 
time from essential!) a "look and say M ap- 
proach to a "phonic** approach, with a nurrJber 
of othe. variations. There are experts and pro- 
ponents of each with deep professional and 
commercial commitments. There ate also busi- 
ness organizations which have invested large 
sums of money in traditional texts and basal 
series designed to leach beginningor remedial 
reading. Obviously, for them a shift to a new 
alphabet would impose a considerable finan- 
cial burder. 

thile educators do not like to acknowledge 
the impact of the commercial organization upon 
educational policy, it is undeniably significant. 
A given afproach to education may succeed or 
fail in part as a reflection of the extent to 
which usable materials are available. Ibe 
Initial Teaching Alphabet has hed its prede- 
cessors in the form of ether transitional alpha- 
bets or spelling reform movements. Sir Isaac 
Pitman was instrumental in developing "Fono* 
tvpy" in the late 19th century. l^aler, in both 
the I920*s and I9l0*s, the simplified spelling 
movement was popular* All of these move- 
ments, while experiencing considerable suc- 
cess, were ultimately discarded. One of the 
major reasons appears to have been a lack of 
sufficient range aad quality of tomtertial ma- 
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terials (Harrison, I960- Perhaps i.t.a has 
become as controversial es it has because wc 
have a competitive commercial market today. 
Many corporations have become involved in 
this educational innovation. In Great Britain, 
over thirty-five commercial institutions are 
preparing materials in i.t.a., and at least 
twelve are known to be doing so in this coun- 
try. In addition to many hundreds of children’s 
books, major manufacturers have prepared 
typewriters, rubber stamps, flash cards, and 
aecal-type material. Printers have indicate 
that i.t.a. fonts are available, and major test 
publishers havt transliterated well-known and 
widely-used achievement tests into i.t.a. Some 
individuals have viewed the introduction of 
these commercial organizations as a threat to 
education. Others view it as providing the 
necessary materials for o comprehensive pro- 
gram so that i.t.a. may have a full and con- 
clusive evaluation. 

With all this activity, there has been peat 
interest in i.t.a. Over two hundred articles 
have been published in the mass media and 
professional journals describing i.t.a. or re- 
portingon its effectiveness. Vernon (1965) has 
suggested that this publicity has invalidated 
th i i.t.a. studies in England. It is my opinion 
that publicity both in this country and in 
Kngland was essential to gain sufficient co- 
operation from professionals and parents to 
permit evaluation of the effectiveness of this 
new medium. 

Kith all this activity, what has the research 
indicated thus far? Viewed from the most con- 
servative point of view, the research has 
shown that fears that there would be difficulty 
I. in transition and poor spelling have not been 

realized. Khilc there is a great risk in over- 
generalization, r*any of Ac studies (Alden 
and Manning, 1965; Baker, 1966; Bosnia and 
Farrow, 1965; Downing, 1965; Dunn, Muller 
and Neely, 1966; Jameson, 1965; Mazufkiewic* 
j 1961; Monteal, 1965; Myers, 1965; Regan, 

| 1965; Shrpiro, 1966; and Sloan, 1966) indicate 

| significant differences in reading achievement 

in favor of i.t.a. ggoups when corr^ared with 
controls, while a number of other studies 
(Downing and Jones, 1966; Hahn, 1966; Moun- 
tain *066; Nemeth and Hayes, 1966; Tanyter, 
1966b; and Tanyter, Alpert and Sandel, 1965) 
show no significant difference between i.t.a. 
taught students and control ffoups. 

Rven in the case of studies purporting to 
show no significant differences there is partial 
evidence for the success of i.t.a. Nemeth and 
Hayes <1966)for example, generally report that 
the i.t.a. group scored significantly higher 
than the basic T.O. controls but not always 



higher than a highly phonically-orientcd T.O. 
approach. In addition they indicate that their 
evidence suggested thatthe greatest advantage 
of i.t.a. seemed to be for children in the low est 
third of students ranked by I.Q. Similarly, 
McCracken (1966b) found that his i.t.a. group 
scored significantly higher than a T.O. control 
group on some of his measures but not cn all. 
Both the i.t.a. and the T.O. control group 
scored significantly higher than a svb-control 
group unaware that they were part of unexperi- 
ment. 

Thus far, I am not aware of any study which 
has presented evidence that children taught 
with the Initial Teaching Alphabet do generally 
less well than those taught with the conven- 
tional alphabet. Thus, criticism of the alpha- 
bet has shifted from a focus upon potential 
difficulties in transition and spelling (which 
are presumably no longer a serious concern) to 
suggestions that I.t.a. either is not better than 
our conventional approach or that, given an 
early advantage to the i.t.a. group, the per- 
sistence of the advantage is questionable 
(Downing, Rose and Cutts, 1961; Southgate, 
1965; and Vernon, 1965). 

It is important to note that virtually every 
study investigating the effectiveness of the 
Initiat Teaching Alphabet has contained one 
or more nrrioua methodological errors. Of 
course, this is probably true of most studies 
in the field of education in general, and more 
broadly, in all ar:as of applied research. In- 
vestigators in educational research do not have 
the opportunity for rigid laboratory controls. 
They do not use lower animals and concern 
themselves with simple variables. They in- 
vestigate inherently co.nplex material in a 
complex applied setting. Although technically 
desirable, it is not always possible, and, in- 
deed, at limes would be unethieal to present 
all combinations of one’s experimental varia- 
tions to persons who are learning to read. It 
would be difficult to obtain parental approval 
focmany theoretically interesting combinations 
of variables. Thus, we must rely upon the 
ethics of the profession and permit the weight 
of number of studies to compensate for the lack 
of complete control. 

Nonetheless, it might be useful to point out 
some errors in research design in studies of 
i.t.a, and to .suggest what mirfit be done to 
overcome or circumvent them. In some cases, 
the nature of the methodological error would 
be likely to reduce the apparent improvement 
achieved by the use of i.t.a. In other cases, 
the flaw in the design might bo expected to 
increase the apparent effecti\eness of i.t.a. 
In still other cases, it would be difficult to pre* 
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dicl the net effect upon the dependent van* 
able, but conclusions based on the studies 
would necessarily be extremely tenuous. 

Although there have been a great number cf 
specific methodological errors in the Lt.a. 
studies conducted thus far, they appear to fall 
into the three rather broad categories: errors 
in research design; the duration of investiga- 
lions; and problems associated with the cri- 
terion ncasures.. Each of these areas will be 
treated srpa.alcly with a number of specific 
problems discussed under each. Obviously, 
the many problems discussed in this paper 
are interrelated and any independent discus* 
sion of categories must necessarily be some- 
what arbitrary. 

/. Errors in Research Dosigt 

Problems associated with research design 
involve such factors as the **se of control 
grou; i and the ability to match them with the 
experimental groups for factors related to 
reading; the materials used by each? and, the 
teachers involved. Additional problems include 
elimination or control of the Hawthorne Kf fee l 
and the investigator's attitude, as well as 
considerations of sample si/e and the statis- 
tical analyses used. 

Control Grovps ~ Several studies have been 
conducted without benefit of control groups 
(Dcvercll, 1966: Georgiades and Downing, 
1961; Hastings, 1966; Kemp, 1966; Mason, 
1966; and Stevenson, 1966). Implicit in many 
of these is the suggestion that a control group 
does exist in the form of ^rior experience with 
many groups of students taught in a conven- 
tional manner, fchile these studies do not pre- 
sent statistic il evidence for or against i.t.a., 
they may be fruitful as pilot studies. The op- 
portunity they provide for direct personal ex- 
prtience may yield a greate. sense of validity 
for many teachers than abstract sets of statis- 
tical tables. 

0 her studi® have made efforts to makh 
individuals or groups on such variables es 
age, sex, I.Q., and reading readiness* (Down- 
ing, 1965; Uaiurkiewict, 1966a; M)ers, 1965; 
and Sloan, 1966). Jhilt these would seem *o 
be relevant factors for matching, a nunher of 
the studies have been forced to resoit to match- 
ing classes in two different schools. For ex- 
ample, Bosma and Farrow (1965) used students 



in a campus school matched with those in a 
neighboring public school district. While the 
authors state they have reason to believe the 
two schools are comparable and the subjects 
arc matched on the demographic data noted 
obove, the environn.ent of a campus school 
might be expected to be quite different from a 
public school setting. As i.t.a. becomes in- 
creasingly accepted, it should be possible to 
replicate the research conducted thus far usin^ 
more adequately matched groups. 

In addition, there is the question of the ways 
in which the control groups should differ from 
the experimental groups. Some investigators 
have suggested that only the media be changed 
and that the content of the material remain con- 
stant (Downing, 1965). There is a marked 
methodological advantage in th is procedure 
since one changes only a single independent 
variable at a time and may more adequate!) 
identify the specific factor to whichany change 
may be attributed. On the other hand, it can 
be argued that one virtue of i.t.a. is that it 
permits an author to relate his materials more 
directly to the interest and sophistication 
levels of his audience, since he »s no longer 
concerned with a rigidl* controlled vocabulary 
as a means of handling the inconsistencies of 
written Knglish. A study presenting the \ocab* 
ulary of an i.t.a. series with its tesr rigid vo- 
cabulary control to beginning readers In T.O. 
represents a combination of variables which 
one would hesitate to manipulate in fear of 
doing irreparable damage to the child's ability 
and attitude toward reading. 

Nonetheless, if i.t.a. may be used to pre- 
sent more challenging materials, then its ef- 
fectiveness in this approach must be evalua- 
ted. Further, one may argue that there are 
several ways to control the idiosy ncracies of 
Knglish. One such method is to present a 
highly controlled vocabulary so that these 
idiosyncrasies may be introduced gradually 
and systematically. Another method is to 
provide a transitional alphabet to eliminate 
problems initially and at the same lime pro- 
vide mere satisfying reading material. The 
idiosyncrasies of wtilten Knglish may be 
gradually and systematically introduced as 
the transition takes place* If vocabulary ton* 
trol reduces the complexities of wtilten Kng- 



I woald raise the question of the utility of match- 
ing on the hast* cf reading readiness measures. 
Kirn, ire do net usually use measures of jpei&- 
ing, listening or writing readiness. Second, such 



measures are highly correlated with I.Q. Finally, 
there is an iepiictl * sumption In the tons* net ion 
of the measure that the child will he reading in 
T.O. i.t.au may iuw&lve substantially dHfetent 
skills and p ft cep tail abilities. 
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lish, then there would be an advantage in 
superimposing a second control of a transi- 
lional alphabet only if vocabulary' control is 
not completely effective, 

Obviously, both kinds of studies are needed 
(i.c., those which change the alphabet only 
and those which change content as well). Since 
some series emphasise the stimulating na- 
ture of the materials as well us the alphabet, 
it is suggested that investigators consider the 
use of dcpcndenl variables other than reading 
achievement per se . Tor example, measures of 
personality, inlcrcslsjor attitudes toward read- 
ing and toward school may be more crucial de- 
pendent variables in such studies if it could 
be shown that the mechanics of reading may- 
be mastered as well in i.t.a. as in T.O. 

A furrier problem in matching lies in the 
control of the teacher variable. In some eases, 
investigators have pointed out that requiring 
teachers to teach both traditional orthography 
(T.O.) and i.t.a. would not be possible since 
it is difficult to insist that a teacher shift 
completely from one medium to another. Morale 
would likely suffer and school admirm tralc/s 
are seldom willing to antagonize then staff 
unnecessarily- Downing and Jones (1956) tred 
to control this variable by having each teacher 
spend half her lime with the T.O group and 
half with the i.t.a. jgoup. Their preliminary 
results suggest that this procedure may have 
been unpopular enough to lower the reading 
achievement level of the children. Others have 
attempted to mtteh teachers according to 
length of experience or other comparable 
measures (Sloan, 1966). 

To the extent that teacher altitudes are im- 
portant factors in their effectiveness with 
children (and there are few who would ques- 
tion this assumption), it Is of vital importance 
to provide evidence of adequate control of this 
variable. 

Once again, as i.t.a. becomes increasingly 
popular, strong positions both positive and 
negative, will continue to develop, ft should 
be progressively easier to find groups of teach- 
ers who are deeply committed either for or 
against i.t.a. (ft is rate to find among those 
who are familiar with I.t.a. persons lacking 
strong attitudes.) Teachers with opposing at* 
litodes should be matched in terms of strength 



of feeling ns Well us amount of teaching exper- 
ience and other variables associated with 
teaching competence, ft is possible that there 
me important differences between teachers 
who support i.t.a. and those who do not. Ibis 
problemwould be interesting to study in itself. 

The liaulhornc Effect - The well-known 
Hawthorne Kffeet (that is, the tendency for the 
results of an experiment to be largely deter- 
mined by the fact that the subject knows he is 
participating in an experiment) has been cited 
by those critical of i.t.a. and by almost every 
investigator finding positive results, although 
Dunn, Muller and Neely (1966), for example, 
believe that the Hawthorne Kffeet was u nol 
sufficient to explain the results 0 in theirstudy. 
It is generally assumed that the llawlhorne 
Kffeet has operated in such a way us to mo- 
tivate teachers of the i.t.a. groups to expect 
higher levels of achievement than they would 
normally expect. It should be pointed cut that 
the Hawthorne Effect could have exactly the 
reverse effect in some studies, particularly 
since many reading experts have expressed 
strong negative attitudes toward i.t.a. Tlie 
Hawthorne Effect Is extremely difficult to 
control. If it has not been adeqialely control- 
led in the past, it should be increasingly easy 
to find groups equally committed for and 
against i.t.a. Further, as there is greater ex* 
posure of the public, teachers, and adminis- 
trators to the Initial Teaching Alphabet, what- 
ever novelty effect the alphabet may have 
had in the past shoulJ be a less potent factor 
in future studies. 

Early studies are likely to have been con- 
ducted by those deeply concerned with the 
problem. Some investigators have been con- 
cerned with the preparation of i.t.a. i.iaterials 
while others have been interested in other 
systems,* Myers (1965) served as the teacher 
of the experimental group in her study with a 
colleague smingas the control group teacher. 
While these investigators are both competent 
and ethical, many recent studies have shown 
that studies using lower animals performing 
simple tasks (Kintt, Dclprato, Matter, Per- 
sons and Schappe, 1965; McGuigan, 1963; and 
Rosenthal, 1963) may unconsciously bias the 
results of a study. Kith time, a greater number 
of investigators undoubtedly will become inter- 
ested in research with i.t.a. As a result, there 



* Is xestigntors associated with \t.t.f 
Balter - The freedom to Series 
Downing - The Downing Reader* 
Marurkiewict and Tmiyeet - The 
Earl^to-Retl Series 



Investigators associated with other systems: 
fry - The Diacritical Marking System 
McCracken The Sfcelde* Basic Ret<fing$eries 
Mow tain - Challenge Readers 




36 




should be on increasing amount of evidence 
presented by those who are less psychically 
involved with the alphabet. 

Sample Size - Many studies, particularly 
those assessing the value of i.t.a. for remed* 
ial purposes, have used relatively small sam- 
ples (Baker, 1966; Barclay, 1966; Bosnia and 
Farrow, 1965; Georgiades and Downing, 1961; 
Myers, 1965; and Tanyzcr, 1966a). Some of 
these studies indicate a superiority of the i.t.a. 
groups; others show no difference. In all cases, 
the sample sizes are too small to permit more 
than general suggestions about the relative ef- 
fectiveness of i.t.a. Most of the authors of 
such studies are well aware that their sample 
is of limited size and caution their readers 
about interpretation and over-generalization. 
There is evidence to suggest that, as i.t.a. 
becomes increasingly accepted, inv estigatofs 
in future studies will have larger sample sizes 
available. Such studies should present a more 
adequate base for generalization than from 
any single set of results. 

Statistical Analysis - An integral part of 
any research is the consideration of the sta- 
tistical techniques used to evaluate the quan- 
titati\e aspects of the study. Sta’istical evi- 
dence can be quite compelling. It is a rare 
situation, however, when one meets all of the 
assumptions which underlie the mathematical 
properties of the formulae to be used. As 
one illustration, Tanyzer, Alperl and Sandel 
(1965), in studying over four hundred chil- 
dren in i.t.a. classes as compared with 
four hundred in T.O. classes, used the 
classroom as the unit of analysis. They 
observe that, since the chitdren in a given 
classroom are exposed to a common teacher 
and relatively common experiences, children’s 
scores do not represent individual observa- 
tions. They cite Campbell and Stanley (1963) 
for this rctionale. hhile there is much to be 
said for this argument, there is no attempt to 
consider the effect of the relatively large nuo 
her of children in Tenyrer el a**s (1965) anal- 
ysis, They used a total of thirty-four clasps 
(1? i.t.a. and 17 T.O. classes) with ove- 
twenty children per class. Ihus, their sample 
sire is apparently redact d from over ei |M 
hundred to thirty-four. Since each class had 
over twenty children in it, data from such a 
group might be considered more reliable than 
would be the case if the classes had contained 
only five or six children each. 

Ihis problem is not easy to resolve, since 
students grouped in classes are not inde- 
pendent. However, until there is n generally 
accepted unit of analysis foe research in edu- 
cation, the present author would suggest pre- 



sentation of data using both the individual and 
the class as the unit of analysis. This would 
permit comparison with earlier or subsequent 
investigations using either approach. Knch 
reader might then make his own evaluation o( 
the extent to which he accepts the underlying 
assumptions. 



II. Duration of (he Investigation 

Another important consideration in the eval- 
uation of *%ny investigation is the point in time 
at which final evaluation takes place. Since 
reading is a skill one uses over a lifetime, it 
is theoretical ly possible to evaluate one or 
more aspects of reading resulting from a par- 
ticular method or medium at almost any time. 
Cults has argued, for example, that the evalua- 
tion of i.t.a. cannot be complete until children 
taught with the alphabet reach the fourth or 
fifth grade (Downing, Rose, and Cults, 1961). 
In contrast, most studies have been of one 
year’s duration. The point to be made here is 
that any investigator or sponsor of an investi- 
gation must consider carefully the implica- 
tions and assumptions of whatever tine period 
is studied. Rilh i.t.a* for example, this in- 
cludes such factors as the number of children 
having made the tranaition at one point, the 
amount of instruction end exposure to reading 
materials during the pcr«>d; the other educa- 
tional experiences the experimental and con- 
trol groups have subsequent to the study (i.e M 
dothevtake the same curriculum, and what as- 
sumptions about prior achievement ar^ built 
into this curriculum); and how experienced are 
the teachers in using this new medium. 

One* Year Studies ^ Frequently, for adminis- 
trative reasons, the time for the completion of 
an investigation is arbitrarily fixed at the end 
of the first year. This was true, for example, of 
all of the studies sponsored bv the C.S. Office 
of education (llahn, 1966; Marurkiewlcr, 
1966b; Mountain, 1966; Nemeth and Hayes, 
1966: and Tanyzer, i966b). It is widely ac- 
cepted amongboth the proponents and opponents 
of i.t.a. that at the end of the one year e large 
percentage of children will not have formally 
made the transition to our traditional orthog- 
raphy (usually from twenty-five to fifty per 
cent). Ihe minimum goal of all programs using 
i.t.a. is for the child to be able to read effec- 
tively in traditional orthography* As a result, 
dependent variable tests at the end of one 
year are invariably given in T.O. 3 Since many 
children have not made the transition to T.O., 
a lest in what is for them a new alpha* 



*h)i<n tests are given in the medium in which the 

child was leaning to *ead, I.t.a. pours invari- 
ably perform at a high r level \\m T.O. groups. 



37 



bet puls the i.t.e. group at a serious disnd- 
vantage. Many authors have noted that, despite 
this disadvantage, i.t, a. groups will do at least 
as well as T.O.-trained children. In contrast, 
McCracken (1966b) shows that children taught 
ih T.O. who took a reading test transliterated 
into i.t. a., apparently lose about one year in 
reading level. ITius, one might assume that a 
comparable loss could occur (or i.t. a. children 
although they typically score higher or at the 
same level os the control group. 

Rchn (1963) and Fabino (1963) have sug- 
gested that there is insufficient overlap even 
between different basal series in T.O. to war* 
rant ease in transfer from one to another. In 
their strdies, they were concerned with differ- 
ent variations in controlling vocabulary using 
the same alphabet. It would be interesting to 
study the effect of having children take a lest 
printed in cursive writing as compared with 
conventional print when they had had limited 
training in cursive witting. 

Since none of the proponents of i.t. a. sug- 
gest that there is a specific time at which the 
child muif make the transition, a number of al- 
ternatives suggest themselves. Md>urkiewier 
(1966a) attempted to rcsol\e this problem by 
comparing the performance of the children who 
ha* e made the transition with a matched group 
sample from the control group, ^hilc these 
children were matched on I.Q, , sex and socio- 
economic status.it is possible that these vari- 
ables do not fully reflect their reading ability. 
Ihose who perform best with i.I.a. materials 
and make the transition from i.I.a. to T.O. first 
are among the bfst readers, fchile matching on 
other variables icpresents an attempt to ccn* 
trol this, those in the matched control group 
may not stand as high in reading ability within 
their own group, fihile there is no evidence on 
this point, a more defensible procedure might 
be to compare those who have completed the 
transition with an equal number of T.O. chil- 
dren representing the highest reading scores 
in their gtoup. An alternate procedure, wail- 
ing until m st children have made the transi- 
tion, might provide more representative data. 
However, this procedure suffers from the dis- 
advantage that the slowest children determine 
the point of administration of the dependent 
variable. Ihas, any gains made by the first 
children to transfer may be obscured s»nce the 
advanced i.t.a. group and the T.O. group will 
have been exposed to identical curriculum ex- 
periences for a substantial petiod of time after 
trrr.silion. 

Further, with the rigid on e-iear duration, one 
may question bow much formal instruction in 
reading had actually taken place at the end of 



one year. Much of this period is devoted to 
reading readiness preparation, which may or 
may not involve the use of a special alphabet. 
If the period of exposure to reading instruc- 
tion is loo short, it may be difficult to gener- 
ate differences between any set of methods or 
media. In any event,, it is my contention that 
an /./.a. program cannot be adequately evalu - 
ated in one year. 

Where significant differences have existed 
in favor of the i.t. a. groups, the question has 
been raised “How long will the advantage 
lnst? M (Downing, Rose and Cults, 1961; and 
Vernon, 1965). It occurs to me as it has to 
others, that while some children may learn to 
read with i.t. a. who would not have learned to 
read at all, advantages accruing to the i.I.a. 
group may, in fact, disappear in the later 
grades. Rhile only further research can verify 
this hypothesis, it seems quite reasonable. If 
children in the i.I.a, group ore reading better, 
writing better, and have more positive attitudes 
toward school at the end of a study, unless 
there is a change in the traditional curriculum 
to permit them to make use of this new found 
skill, any differences between i.I.a. and T.O. 
groups must invariably disappear as they con- 
tinue to be exposed to common curricula based 
upon what the T.O. group was assumed to have 
achieved. i.I.a. presents a challenge for re- 
thinking the curricula beyond first grade. If 
there are advantages to i.t, a., we must lake 
advantage of them . 

Tracker Experience - It was noteJ earlier 
that teachers (and perhaps students and p .*ents) 
in i.t. a. studies maybe influenced by the Haw'- 
thof~e Kffecl. It should also be pointed out 
that, in the first year of the study, a teacher 
may not feel completely at ease with the new* 
alphabet, and, in fact, rr>ay not know how to 
deal with its full potential comfortably and 
easily in the classroom. Marurkiewic* (1965) 
has suggested that his research is aehie\in* 
better results in the second year of his study 
as teachers become more comfortable with the 
medium. In general, a classroom teacher has 
had many years of experience in teaching with 
the conventional alphabet. Most teacher-train* 
ing courses in i.t. a. are limited to iwo or three 
days, fchile this may be a sufficient period in 
which to explain the principles of tbe alphabet 
and their use in the classroom, if may not be 
sufficient to develop a confidence and ease of 
working with this new medium which permits 
an adequate exaltation of its ultimate effec- 
tiveness in the hands of skiPed teachers. 
Ihere possible, formal evaluation of an i.I.a. 
program should be delayed until after one year 
of experience with the alphabet. 
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///. Problems uith Criterion Measures 

Perhaps the most important consi deration in 
any study is the criterion measure used. Such 
questions f.s, fthnl do you want to measure? 
How reliable and valid are your measurer? Arc 
your different measures actually measuring J i f- 
ferent traits? Arc your measures contandna ed 
with some extraneous variable?, all must be 
considered. It is upon these considerations 
that one’s final conclusionswill be based. 

Vhat Measure Should He Used? — A great 
many problems are associated with the final 
measures used to evaluate the Initial Teaching 
Alphabet ;n all of the i.I.n. studies conducted 
thus far. First, almost even study has focused 
largely, if not entirely, upon traditional mens* 
ures of reading. Obviously, an alphabet may 
be used not only for reading, but for writing 
as well. Farther, the printed word is widely 
accepted as affecting altitudes and emotions. 
Some evidence already exists that the writing 
of children taught with i.t.a. may be dramatic- 
ally affected (l)owning, 1961: Gardner, 1966; 
Montesi, 1965* and Rcsrek, 1966). Teachers 
have subjectively reported much greater inter- 
est in reading* more positive attitudes toward 
school in general, uid greater independence 
on the part of the children. Stewart (1966) in* 
dicates changes not only in reading achieve- 
ment, but in "independent learning, motiva- 
tion, perseverance, the ability to observ e, and 
tbe ability to write. 0 Similar results are re* 
ported by Rare I ay (1966) working with emotion- 
ally disturbed children. Perhaps it would be 
profitable to include measures in other curric- 
ulum areas (e.g., arithmetic) to evaluate the 
role of i.t.a. in generating attitudes toward 
school. Before accepting or rejecting the pres- 
ent evidence, we clearly need better instru- 
ments than we now have to measure writing 
skilts and attitudes, (f i.t.a. accomplished 
nothing mote than stimulating the development 
of mote adequate measures of children's writ- 
ing. altitudes toward school, attitudes toward 
reading, independent behavior, etc., it will 
have performed a valuable service to the field 
of education. It should make us increasingly 
aware that reading represents only one way in 
which we use our language. Our approach to 
reading will have an impact on a child's de- 
velopment in many spheres. 

The Adequacy o) Reading Measures - A 
crucial consideration of any measure is the de- 
gree of reliability and validity of the instru- 
ments. Mist of the measures used in i.t.a. 
studies thus ?ar have been reasonably reliable. 
In addition, most test publishers indicate that 
their testshave content validity. That is, most 
publishers indicate that the professional per- 



sonnel who have constructed the tesis have, 
in one way or another, surveyed a number ot 
the most widely used texts and have tailored 
their tests to follow the curriculum covered in 
these materials. ITis raises the question, "To 
what extent arc these teals valid for children 
who have learned to rend with i.t.a. materials 
not i^ing a controlled voi obulary?" To the ex- 
tent that the curriculum ns presented in such 
texts is different from the traditional curricu- 
lum, traditional reading tests will not provide 
valid measures, 

An analogy may bo drown between the use 
of i.t.a. in teaching rending and the use of 
'‘Modern Math." In recognition of the change 
in approach to the teaching of mathematics, 
Hat court , Brace and FVorld, Inc. have recently 
published a new instrument in the Stanford 
Achievement Test scries called the '‘Modern 
Mathematics Concepts Tests." In their pro- 
motional literature they note, “Mathematics is 
not new. Tbe old rules still obtain; the rela- 
tionships of quantities, figures, and processes 
have not changed. What has changed is the ap- 
proach to teaching mathematics, the concepts 
taught as a basis for improved understanding 
and application of mathematics." In a cover- 
ing letter with their brochure, the first two 
sentences read, “How is the Modern Math pro- 
gram succeeding? A major need at all stages 
of new curricula programs is objective evalua- 
tion of achievement in terms of new criteria, 
new content, new objectives and methods." 

I raise the question as to whether or not 
tests developed to reflect curricula presented 
in traditional basal reading series are appro- 
priate for use ns dependent variable measures 
in i.t.a. studies. This is of particular impor- 
tance in studies using i.l,a. books which de- 
part from a controlled vocabulary. It is urged 
that traditional reading measures be carefully 
re-examined. Further, where necessary , new* 
tests should be developed which reflect curric- 
ula goals existing between I.t.a. *nd T,0. 
programs and eliminate any variables which 
are unique to either program. 

It should be noted hete that in two surveys 
of the literature to find methods of appraising 
reading abilities and determining which read- 
ing abilities should be measured, Gillmofe 
(1963) concluded that there was no single de- 
vice adequate to measure all reading skills. 
Jhe concluded that much of the evaluation is 
dependent upon the teacher who is in a posi- 
tion to observe the child daily. Future i.t.a* 
studies might include carefully constructed 
rating scales completed by teachers to eval- 
uate the child's progress. Thobum (1966) re- 
ports a study of the subjective evaluations of 




teachers and administrators after using i.t.a. 
She indicates that they feel that substantial 
gains are possible with this medium. Georglades 
and Downing (196-4) and others have reported 
similar results. 

Correlated Measures - One of the common 
methodological problems in research In educa- 
lion end in virtually all of the studies involved 
with the Initial Teaching Alphabet relates to 
the fact that a large number of tests or sub- 
tests are used. Very few of these studies indi- 
cate the degree of Inlercorrelatlon between 
these sub-tests whirh are treated as “inde- 
pendent" measures. It seems highly reason- 
able that spelling ability would he highly cor- 
related with word recognition, reading compre- 
hension and perhaps a number of other reading 
measures. While test manuals generally fail to 
reporleub-lest intercorrelations, Mitchell (1962) 
reports correlations between ^51 • .63 for total 
score on the Metropolitan Achievement Readi- 
nessTestand its sub-scales. Sbe reports aver- 
age sub-test intercorrelalions of about .55. 
Thus, these measures are not independent of 
each other. Furthermore, children in educa- 
tional research studies are taught in the same 
classroom, using the same books, by the same 
teacher. All of these factors tend to produce a 
high conelation between measures which are 
treated as though they were independent. Thus, 
in any given study, if one measure shows a sig- 
nificant difference in favor of the i.t.a. group, 
then it is highly probable that all measures will 
indicate this. Contrariwise, il one measure 
shows r>o significant difference, then it is 
highly probable that no significant differences 
would be leflected on the other measures in 
the study, tn evaluating the results of each 
study, one is tempted to conclude that it has 
shown either that i.t.a. was more effective, or 
that it was generally comparable to T.O. in 
all measures. 

It is strongi; recommended that future in- 
vestigations of the effectiveness of this alpha- 
bet (or for that matter, any method or medium) 
attempt to use measures which are as inde- 
pendent of one anothet as possible. An abso- 
lute minimum requirement should be that the 
investigator indicate the degree of inter-rela- 
tionship between the various dependent vari- 
able measures useJ to permit his readers to 
evaluate his results. 

ContaaunoieJ Mtasuns - Another question 
which occurs to me is “lhat do our conven- 
tional reading tests measure?" Generally, test 
manuals present correlations between their in- 
strument and other publishers' reading tests. 
They seldom indicate the correlation between 
the test and other measures such as intelH* 
gence. 



Strang, McCullough, and Traxlcr (1955) how- 
ever, have cited three studies suggesting that 
the correlation between measures of reading 
and measures of intelligence is between .50 
and .80. Further, Austin, Oush and Huebner 
(1961) show graphically the marked role of in- 
telligence in accounting for reading achieve- 
ment, particularly in the first and aecond 
grades. To the extent that reading achievement 
tests are correlated with l.Q., it is difficult, if 
not impossible, for any method or medium for 
teaching reading to show any real superiority 
over any other method. It is generally assumed 
that l.Q. remains relatively constant. Obvious- 
ly, an adequate stud) of reading ability must 
partial out the factor of Intellectual ability and 
in fact, some investigators have attempted to 
do this (Shapiro, 1966). Yet, if LQ. accounts 
for a targe proportion of the variance of read- 
ing scores, one is left with the question of 
what is left in traditional reading measures 
when l.Q. is removed?Rh$it evidence of valid- 
ity do the tests indicate under these condi- 
tions? Once again, if lalsing these questions 
in the evaluation of the Initial Teaching Alpha- 
bet serves to promote an interest and effort 
toward developing more adequate measures of 
a child's reading ability independent of his 
intellectual level, then i.t.a. will have per- 
formed an important service to the field of ed- 
ucation. 

Summary and Conclusions 

Despite the research errors (and there have 
been few educations) methods, media or ma- 
terials which have been as intensively studied 
as i.t.a.) some general conclusions may be 
reached even now. One source of resistance to 
i.t.a. has been with regard to the potential 
"risks" involved. These are expressed largely 
in terms of the belief that children will not be 
able to make the transition. That they will not 
be able to spell in T.O That in the long run 
they will be less well off than T.O.-taught 
children. The evidence suggests clearly that 
this is not the case. Children laudit *ith i.t.a. 
do <W least as iref/ as T.O.-taught children ~ 
using T.O. standards for evaluation* This de- 
spite ;he fact that relatively Iwge percentages 
of children are tested with ax alphabet which 
they have not been reading* Downing and Jones 
0965) foeexample t report that <>nly of their 
four- and five-) ear old children had made the 
transition at the time of testing, yet the total 
group did not score significantly different from 
cftit<ben in the control gyoup. 

Expressed from a more positive pout ol 
vieW'Onecan point to a number of studies ten* 
ducted by different investigators which present 
quite positive results using both objective and 
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subjective data. The question for the adminis- 
trator should become “If there seems to be 
little risl^! can I afford to ignore the potential 
gain suggested by i.t.a. research?" 

Ther<» have been many methodological errors 
in studies attempting to evaluate the effec- 
tivcncssof i.t.a., but to no greater degree than 
wo usually find in educational research. Some 
of these errors may have had the effect of mak* 
ing i.t.a. appear more effective than it is, 
while It is obvious that many of these errors 
have served to reduce the apparent effective* 
ness of the alphabet. Some of the problems as* 
sociated with the early studies will be easily 
resolved in future studies. With greater inter* 
est and experience with i.t.a., the Hawthorne 
Effect should be reduced to a negligible de- 
gree by now. The influence of publicity, if it 
has had any adverse effects, should he less 
potent. It should also be possible to obtain 
larger and mo f c adequately equated groups. 
Further, with greater interest among diverse 
groups* control of the teacher variable should 
be easier, and with an increasing number of 
studies, the role of the attitudes of the inves- 
tigator should come under control. 

It should be emphasised that these controls 
will not come automatically. It is simply that 
we should be better able to control them once 
we are aware of their influence and have a 
wider range of school systems, teachers, and 
investigators interested in determining the ef* 
fectiveness of i.t.a. 

One obvious conclusion is that studies may 
be of varylngduration, but in any event, greater 
than one year. Further, investigators must con- 
sider the effects of other aspects of the cur- 
riculum on the lime period they select, aryl 
select measures appropriate to the duration se- 
lected (e.g., measures of simple reading skills, 
attitudes toward reading, etc.). 

This Iasi point raises one of the moslpress- 
ing issues in educational research - the need 
(or more adequate meas*tes. We need measures 
of other factors related directly or indirectly 
to reading incl«ding # aUitudes toward reading 
and school in general, writing ability, etc. Fur- 
ther, we must take a hand look at our reading 
measures* Are the sub-tests measuring differ- 
ent characteristics? What are we measuring 
besides I.Q.? Can we control for t.Q. and still 
have meaningful measures? Belter instrumen- 
tation lies at the heart of more adequate re- 
search in education. 

t.r.o. StwJas - This paper has sug- 
gested that it would be profitable to replicate 
many of the early studie i using more adequate 
controls. Future studies must obviously be 
longer than most of the early ones and investi- 



gate a wider range of relatively independent 
phenomena. Yet, there arc some special con- 
siderations future studies must deal with. We 
need much more research in the area of special 
education and in the remedial use of i.t.a. We 
need to know more about its effectiveness in 
teaching English as a second language. We 
need to know*, not only what happens when the 
transition has taken place, but what, if any, 
advantage there might be tn prolonging ex- 
posure to i.t.a. in an effort to build increased 
confidence. We need to know whnt method or 
methods work most effectively with the Initial 
Teaching Alphabet. While the alphabet does 
not demand that a teacher utilize any particu- 
lar method, it Is possible that onr approach Is 
more effective than another. Wo have sprnl a 
great many years aitempHng to learn what 
teaching methods may be used to greatest ad- 
vantage with our conventional alphabet. We 
should undertake similar investigations with 
i.t.a. We should leamhow to use it; who should 
use it; when it should be used; how long It 
should be used, etc. 

It seems reasonable to assume that I.t.a. (or 
for that matter, any educational Innovation) will 
never be universally used no matter how effec- 
tive it may be. Many minds are already closed 
to the issue. Some are convinced that the 
Initial Teaching Alphabet Is the solution for 
which they have been watting. Others are 
equally convinced that it does not work Some 
argue that it obviously works and should be 
adopted. Some argue that It should not be ac* 
cepled too quickly - others are arguing it 
should be rejected immediately. 

This f*per has attempted to suggest some 
improvements in design. It has suggested not 
only that we can do a more effective job In eval- 
uating I.t.a., but that attempts to provide 
proper experimental controls inevitably will 
force investigators to re-examine many basic 
concepts in the field of educational research 
and evaluation. 
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The Fourth International i.t.a. 
Conference 
will be held at 

McGill University, Montreal, Canada 
August 1967 

The winter issue of the i.t.a. Foundation 
Report will contain o general 
outline of the program and 
forms for registration. 

Individuals Interested in presenting 
papers at the Conference should 
contact the Foundation as 
early as possible Indicating 
the nature of the proposed 

presentation. See p. 27 




